Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 528 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 53.8 KiB |
| Average record size in memory | 104.2 B |
Variable types
| NUM | 8 |
|---|---|
| CAT | 5 |
animal_category has constant value "528" | Constant |
species has constant value "528" | Constant |
killed_and_disposed_of_thsdhd is highly skewed (γ1 = 20.04431811) | Skewed |
new_outbreaks is highly skewed (γ1 = 21.95958064) | Skewed |
cases_thsdhd has 91 (17.2%) zeros | Zeros |
deaths_thsdhd has 212 (40.2%) zeros | Zeros |
killed_and_disposed_of_thsdhd has 214 (40.5%) zeros | Zeros |
new_outbreaks has 502 (95.1%) zeros | Zeros |
slaughtered_thsdhd has 416 (78.8%) zeros | Zeros |
susceptible_thsdhd has 115 (21.8%) zeros | Zeros |
vaccinated_thsdhd has 434 (82.2%) zeros | Zeros |
Reproduction
| Analysis started | 2022-04-08 16:50:54.624686 |
|---|---|
| Analysis finished | 2022-04-08 16:51:07.940368 |
| Duration | 13.32 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
year
Real number (ℝ≥0)
| Distinct | 11 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2015.25 |
|---|---|
| Minimum | 2011 |
| Maximum | 2021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 2011 |
|---|---|
| 5-th percentile | 2011 |
| Q1 | 2013 |
| median | 2015 |
| Q3 | 2017 |
| 95-th percentile | 2020 |
| Maximum | 2021 |
| Range | 10 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.729069042 |
|---|---|
| Coefficient of variation (CV) | 0.00135420868 |
| Kurtosis | -0.8381832564 |
| Mean | 2015.25 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.1788019969 |
| Sum | 1064052 |
| Variance | 7.447817837 |
| Monotocity | Increasing |
Histogram with fixed size bins (bins=11)
| Value | Count | Frequency (%) | |
| 2017 | 68 | 12.9% | |
| 2015 | 66 | 12.5% | |
| 2016 | 65 | 12.3% | |
| 2014 | 59 | 11.2% | |
| 2013 | 54 | 10.2% | |
| 2011 | 53 | 10.0% | |
| 2012 | 51 | 9.7% | |
| 2018 | 42 | 8.0% | |
| 2020 | 29 | 5.5% | |
| 2019 | 27 | 5.1% |
| Value | Count | Frequency (%) | |
| 2011 | 53 | 10.0% | |
| 2012 | 51 | 9.7% | |
| 2013 | 54 | 10.2% | |
| 2014 | 59 | 11.2% | |
| 2015 | 66 | 12.5% |
| Value | Count | Frequency (%) | |
| 2021 | 14 | 2.7% | |
| 2020 | 29 | 5.5% | |
| 2019 | 27 | 5.1% | |
| 2018 | 42 | 8.0% | |
| 2017 | 68 | 12.9% |
country
Categorical
| Distinct | 11 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.1 KiB |
| China (People's Rep. of) | |
|---|---|
| Germany | |
| Italy | |
| India | |
| Poland | |
| Other values (6) |
| Value | Count | Frequency (%) | |
| China (People's Rep. of) | 109 | 20.6% | |
| Germany | 56 | 10.6% | |
| Italy | 51 | 9.7% | |
| India | 46 | 8.7% | |
| Poland | 42 | 8.0% | |
| Spain | 39 | 7.4% | |
| United Kingdom | 38 | 7.2% | |
| Brazil | 38 | 7.2% | |
| France | 38 | 7.2% | |
| Netherlands | 38 | 7.2% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 24 |
|---|---|
| Median length | 7 |
| Mean length | 11.625 |
| Min length | 5 |
disease
Categorical
| Distinct | 20 |
|---|---|
| Distinct (%) | 3.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.1 KiB |
| Low pathogenic avian influenza (poultry) (2006-2021) | |
|---|---|
| High pathogenicity avian influenza viruses (poultry) (Inf. with) | |
| Infectious bursal disease (Gumboro disease) | |
| Mycoplasma gallisepticum (Avian mycoplasmosis) (Inf. with) | |
| Avian infectious laryngotracheitis | |
| Other values (15) |
| Value | Count | Frequency (%) | |
| Low pathogenic avian influenza (poultry) (2006-2021) | 124 | 23.5% | |
| High pathogenicity avian influenza viruses (poultry) (Inf. with) | 110 | 20.8% | |
| Infectious bursal disease (Gumboro disease) | 44 | 8.3% | |
| Mycoplasma gallisepticum (Avian mycoplasmosis) (Inf. with) | 34 | 6.4% | |
| Avian infectious laryngotracheitis | 32 | 6.1% | |
| Avian chlamydiosis | 31 | 5.9% | |
| Fowl typhoid | 29 | 5.5% | |
| Avian infectious bronchitis | 27 | 5.1% | |
| Avian mycoplasmosis (M.synoviae) (2006-) | 26 | 4.9% | |
| Newcastle disease virus (Inf. with) | 22 | 4.2% | |
| Other values (10) | 49 | 9.3% |
Frequencies of value counts
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 0.6% |
Histogram of lengths of the category
Length
| Max length | 96 |
|---|---|
| Median length | 52 |
| Mean length | 43.93181818 |
| Min length | 12 |
serotype_subtype_genotype
Categorical
| Distinct | 45 |
|---|---|
| Distinct (%) | 8.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.1 KiB |
| - | |
|---|---|
| H5N8 | |
| H5N1 | |
| H5N2 | |
| H7N7 | 14 |
| Other values (40) |
| Value | Count | Frequency (%) | |
| - | 292 | 55.3% | |
| H5N8 | 41 | 7.8% | |
| H5N1 | 37 | 7.0% | |
| H5N2 | 31 | 5.9% | |
| H7N7 | 14 | 2.7% | |
| H5N3 | 13 | 2.5% | |
| H5N6 | 11 | 2.1% | |
| H5 | 11 | 2.1% | |
| H7N9 | 10 | 1.9% | |
| H7N3 | 8 | 1.5% | |
| Other values (35) | 60 | 11.4% |
Frequencies of value counts
Unique
| Unique | 22 ? |
|---|---|
| Unique (%) | 4.2% |
Histogram of lengths of the category
Length
| Max length | 14 |
|---|---|
| Median length | 1 |
| Mean length | 2.695075758 |
| Min length | 1 |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.1 KiB |
| Domestic |
|---|
| Value | Count | Frequency (%) | |
| Domestic | 528 | 100.0% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.1 KiB |
| Birds |
|---|
| Value | Count | Frequency (%) | |
| Birds | 528 | 100.0% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
| Distinct | 368 |
|---|---|
| Distinct (%) | 69.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 118.7286534 |
|---|---|
| Minimum | 0 |
| Maximum | 6755.467 |
| Zeros | 91 |
| Zeros (%) | 17.2% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.005 |
| median | 0.9005 |
| Q3 | 38.1195 |
| 95-th percentile | 513.8006 |
| Maximum | 6755.467 |
| Range | 6755.467 |
| Interquartile range (IQR) | 38.1145 |
Descriptive statistics
| Standard deviation | 488.087932 |
|---|---|
| Coefficient of variation (CV) | 4.110953152 |
| Kurtosis | 105.7498766 |
| Mean | 118.7286534 |
| Median Absolute Deviation (MAD) | 0.9005 |
| Skewness | 9.309903519 |
| Sum | 62688.729 |
| Variance | 238229.8293 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 91 | 17.2% | |
| 0.001 | 15 | 2.8% | |
| 0.002 | 9 | 1.7% | |
| 0.004 | 8 | 1.5% | |
| 0.005 | 6 | 1.1% | |
| 0.012 | 6 | 1.1% | |
| 0.02 | 5 | 0.9% | |
| 0.01 | 5 | 0.9% | |
| 0.009 | 4 | 0.8% | |
| 0.003 | 4 | 0.8% | |
| Other values (358) | 375 | 71.0% |
| Value | Count | Frequency (%) | |
| 0 | 91 | 17.2% | |
| 0.001 | 15 | 2.8% | |
| 0.002 | 9 | 1.7% | |
| 0.003 | 4 | 0.8% | |
| 0.004 | 8 | 1.5% |
| Value | Count | Frequency (%) | |
| 6755.467 | 1 | 0.2% | |
| 5747.003 | 1 | 0.2% | |
| 3435.738 | 1 | 0.2% | |
| 3328.083 | 1 | 0.2% | |
| 2379.567 | 1 | 0.2% |
| Distinct | 283 |
|---|---|
| Distinct (%) | 53.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.396933712 |
|---|---|
| Minimum | 0 |
| Maximum | 132.684 |
| Zeros | 212 |
| Zeros (%) | 40.2% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.048 |
| Q3 | 5.6935 |
| 95-th percentile | 44.6209 |
| Maximum | 132.684 |
| Range | 132.684 |
| Interquartile range (IQR) | 5.6935 |
Descriptive statistics
| Standard deviation | 20.2450065 |
|---|---|
| Coefficient of variation (CV) | 2.410999919 |
| Kurtosis | 15.35531113 |
| Mean | 8.396933712 |
| Median Absolute Deviation (MAD) | 0.048 |
| Skewness | 3.691911105 |
| Sum | 4433.581 |
| Variance | 409.8602881 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 212 | 40.2% | |
| 0.001 | 13 | 2.5% | |
| 0.005 | 3 | 0.6% | |
| 0.015 | 3 | 0.6% | |
| 0.024 | 3 | 0.6% | |
| 0.05 | 3 | 0.6% | |
| 0.122 | 2 | 0.4% | |
| 0.003 | 2 | 0.4% | |
| 0.099 | 2 | 0.4% | |
| 0.002 | 2 | 0.4% | |
| Other values (273) | 283 | 53.6% |
| Value | Count | Frequency (%) | |
| 0 | 212 | 40.2% | |
| 0.001 | 13 | 2.5% | |
| 0.002 | 2 | 0.4% | |
| 0.003 | 2 | 0.4% | |
| 0.004 | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| 132.684 | 1 | 0.2% | |
| 130.367 | 1 | 0.2% | |
| 129.029 | 1 | 0.2% | |
| 125.757 | 1 | 0.2% | |
| 119.843 | 1 | 0.2% |
| Distinct | 301 |
|---|---|
| Distinct (%) | 57.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 84.56050189 |
|---|---|
| Minimum | 0 |
| Maximum | 15749.788 |
| Zeros | 214 |
| Zeros (%) | 40.5% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.2155 |
| Q3 | 20.9615 |
| 95-th percentile | 249.2598 |
| Maximum | 15749.788 |
| Range | 15749.788 |
| Interquartile range (IQR) | 20.9615 |
Descriptive statistics
| Standard deviation | 717.426243 |
|---|---|
| Coefficient of variation (CV) | 8.484176736 |
| Kurtosis | 433.9482518 |
| Mean | 84.56050189 |
| Median Absolute Deviation (MAD) | 0.2155 |
| Skewness | 20.04431811 |
| Sum | 44647.945 |
| Variance | 514700.4141 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 214 | 40.5% | |
| 0.001 | 5 | 0.9% | |
| 0.014 | 3 | 0.6% | |
| 0.002 | 3 | 0.6% | |
| 0.005 | 2 | 0.4% | |
| 0.032 | 2 | 0.4% | |
| 30.003 | 2 | 0.4% | |
| 0.046 | 2 | 0.4% | |
| 0.102 | 2 | 0.4% | |
| 5.267 | 2 | 0.4% | |
| Other values (291) | 291 | 55.1% |
| Value | Count | Frequency (%) | |
| 0 | 214 | 40.5% | |
| 0.001 | 5 | 0.9% | |
| 0.002 | 3 | 0.6% | |
| 0.003 | 1 | 0.2% | |
| 0.004 | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| 15749.788 | 1 | 0.2% | |
| 2917.495 | 1 | 0.2% | |
| 2104.285 | 1 | 0.2% | |
| 1623.136 | 1 | 0.2% | |
| 1431.452 | 1 | 0.2% |
| Distinct | 11 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.8806818182 |
|---|---|
| Minimum | 0 |
| Maximum | 320 |
| Zeros | 502 |
| Zeros (%) | 95.1% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 320 |
| Range | 320 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 14.14384512 |
|---|---|
| Coefficient of variation (CV) | 16.06010801 |
| Kurtosis | 494.597061 |
| Mean | 0.8806818182 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 21.95958064 |
| Sum | 465 |
| Variance | 200.0483548 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=11)
| Value | Count | Frequency (%) | |
| 0 | 502 | 95.1% | |
| 1 | 11 | 2.1% | |
| 4 | 5 | 0.9% | |
| 2 | 3 | 0.6% | |
| 5 | 1 | 0.2% | |
| 7 | 1 | 0.2% | |
| 8 | 1 | 0.2% | |
| 12 | 1 | 0.2% | |
| 29 | 1 | 0.2% | |
| 47 | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| 0 | 502 | 95.1% | |
| 1 | 11 | 2.1% | |
| 2 | 3 | 0.6% | |
| 4 | 5 | 0.9% | |
| 5 | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| 320 | 1 | 0.2% | |
| 47 | 1 | 0.2% | |
| 29 | 1 | 0.2% | |
| 12 | 1 | 0.2% | |
| 8 | 1 | 0.2% |
| Distinct | 112 |
|---|---|
| Distinct (%) | 21.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.64864773 |
|---|---|
| Minimum | 0 |
| Maximum | 1361.17 |
| Zeros | 416 |
| Zeros (%) | 78.8% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 35.75175 |
| Maximum | 1361.17 |
| Range | 1361.17 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 89.72647 |
|---|---|
| Coefficient of variation (CV) | 6.574019038 |
| Kurtosis | 131.490435 |
| Mean | 13.64864773 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 10.5788297 |
| Sum | 7206.486 |
| Variance | 8050.839418 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 416 | 78.8% | |
| 0.003 | 2 | 0.4% | |
| 0.031 | 1 | 0.2% | |
| 0.047 | 1 | 0.2% | |
| 0.029 | 1 | 0.2% | |
| 0.029 | 1 | 0.2% | |
| 1.107 | 1 | 0.2% | |
| 116 | 1 | 0.2% | |
| 0.082 | 1 | 0.2% | |
| 39.138 | 1 | 0.2% | |
| Other values (102) | 102 | 19.3% |
| Value | Count | Frequency (%) | |
| 0 | 416 | 78.8% | |
| 0.003 | 2 | 0.4% | |
| 0.004 | 1 | 0.2% | |
| 0.005 | 1 | 0.2% | |
| 0.009 | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| 1361.17 | 1 | 0.2% | |
| 993.561 | 1 | 0.2% | |
| 662.399 | 1 | 0.2% | |
| 481.567 | 1 | 0.2% | |
| 478.606 | 1 | 0.2% |
| Distinct | 403 |
|---|---|
| Distinct (%) | 76.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 696.7772102 |
|---|---|
| Minimum | 0 |
| Maximum | 73346.311 |
| Zeros | 115 |
| Zeros (%) | 21.8% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.1215 |
| median | 20.6495 |
| Q3 | 210.149 |
| 95-th percentile | 2908.04115 |
| Maximum | 73346.311 |
| Range | 73346.311 |
| Interquartile range (IQR) | 210.0275 |
Descriptive statistics
| Standard deviation | 3706.089959 |
|---|---|
| Coefficient of variation (CV) | 5.318902375 |
| Kurtosis | 283.6831165 |
| Mean | 696.7772102 |
| Median Absolute Deviation (MAD) | 20.6495 |
| Skewness | 15.18517536 |
| Sum | 367898.367 |
| Variance | 13735102.78 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 115 | 21.8% | |
| 0.001 | 3 | 0.6% | |
| 0.004 | 2 | 0.4% | |
| 3.018 | 2 | 0.4% | |
| 12 | 2 | 0.4% | |
| 5.267 | 2 | 0.4% | |
| 31.985 | 2 | 0.4% | |
| 3 | 2 | 0.4% | |
| 0.13 | 2 | 0.4% | |
| 0.5 | 2 | 0.4% | |
| Other values (393) | 394 | 74.6% |
| Value | Count | Frequency (%) | |
| 0 | 115 | 21.8% | |
| 0.001 | 3 | 0.6% | |
| 0.004 | 2 | 0.4% | |
| 0.006 | 1 | 0.2% | |
| 0.011 | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| 73346.311 | 1 | 0.2% | |
| 17750.013 | 1 | 0.2% | |
| 16628.557 | 1 | 0.2% | |
| 15476.535 | 1 | 0.2% | |
| 15309.175 | 1 | 0.2% |
| Distinct | 95 |
|---|---|
| Distinct (%) | 18.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 72.410875 |
|---|---|
| Minimum | 0 |
| Maximum | 3423.02 |
| Zeros | 434 |
| Zeros (%) | 82.2% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 473.61095 |
| Maximum | 3423.02 |
| Range | 3423.02 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 298.8545384 |
|---|---|
| Coefficient of variation (CV) | 4.12720518 |
| Kurtosis | 53.97638818 |
| Mean | 72.410875 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.532894762 |
| Sum | 38232.942 |
| Variance | 89314.03511 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 434 | 82.2% | |
| 983.361 | 1 | 0.2% | |
| 40.099 | 1 | 0.2% | |
| 877.358 | 1 | 0.2% | |
| 11.846 | 1 | 0.2% | |
| 314.766 | 1 | 0.2% | |
| 113.7 | 1 | 0.2% | |
| 1021.947 | 1 | 0.2% | |
| 8.467 | 1 | 0.2% | |
| 433.84 | 1 | 0.2% | |
| Other values (85) | 85 | 16.1% |
| Value | Count | Frequency (%) | |
| 0 | 434 | 82.2% | |
| 0.015 | 1 | 0.2% | |
| 0.2 | 1 | 0.2% | |
| 0.202 | 1 | 0.2% | |
| 0.465 | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| 3423.02 | 1 | 0.2% | |
| 2936.485 | 1 | 0.2% | |
| 2149.913 | 1 | 0.2% | |
| 1552.688 | 1 | 0.2% | |
| 1532.873 | 1 | 0.2% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| year | country | disease | serotype_subtype_genotype | animal_category | species | cases_thsdhd | deaths_thsdhd | killed_and_disposed_of_thsdhd | new_outbreaks | slaughtered_thsdhd | susceptible_thsdhd | vaccinated_thsdhd | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2011 | Brazil | Avian infectious bronchitis | - | Domestic | Birds | 556.254 | 38.576 | 1.530 | 0 | 0.023 | 1335.599 | 0.000 |
| 1 | 2011 | Brazil | Avian mycoplasmosis (M.synoviae) (2006-) | - | Domestic | Birds | 2379.567 | 1.891 | 1.940 | 0 | 0.840 | 3071.680 | 0.000 |
| 2 | 2011 | Brazil | Fowl cholera (-2011) | - | Domestic | Birds | 12.263 | 4.577 | 0.215 | 0 | 0.470 | 249.082 | 0.000 |
| 3 | 2011 | Brazil | Infectious bursal disease (Gumboro disease) | - | Domestic | Birds | 513.304 | 11.917 | 0.000 | 0 | 0.000 | 593.124 | 0.000 |
| 4 | 2011 | Brazil | Mycoplasma gallisepticum (Avian mycoplasmosis) (Inf. with) | - | Domestic | Birds | 214.844 | 3.448 | 0.001 | 0 | 0.009 | 262.024 | 0.000 |
| 5 | 2011 | China (People's Rep. of) | Avian infectious bronchitis | - | Domestic | Birds | 675.617 | 61.292 | 30.884 | 0 | 5.011 | 6144.577 | 2936.485 |
| 6 | 2011 | China (People's Rep. of) | Avian infectious laryngotracheitis | - | Domestic | Birds | 297.014 | 21.036 | 6.128 | 0 | 0.951 | 2944.032 | 1259.010 |
| 7 | 2011 | China (People's Rep. of) | Duck virus hepatitis | - | Domestic | Birds | 150.113 | 47.873 | 20.781 | 0 | 7.426 | 912.188 | 324.633 |
| 8 | 2011 | China (People's Rep. of) | Fowl cholera (-2011) | - | Domestic | Birds | 350.782 | 72.470 | 43.432 | 0 | 3.624 | 6265.897 | 1552.688 |
| 9 | 2011 | China (People's Rep. of) | Fowl typhoid | - | Domestic | Birds | 3.308 | 0.953 | 0.080 | 0 | 0.045 | 65.342 | 3.826 |
Last rows
| year | country | disease | serotype_subtype_genotype | animal_category | species | cases_thsdhd | deaths_thsdhd | killed_and_disposed_of_thsdhd | new_outbreaks | slaughtered_thsdhd | susceptible_thsdhd | vaccinated_thsdhd | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 518 | 2021 | India | High pathogenicity avian influenza viruses (poultry) (Inf. with) | H5N8 | Domestic | Birds | 23.120 | 16.599 | 58.107 | 4 | 0.0 | 170.288 | 0.0 |
| 519 | 2021 | Italy | High pathogenicity avian influenza viruses (poultry) (Inf. with) | H5N1 | Domestic | Birds | 0.010 | 0.000 | 0.000 | 1 | 0.0 | 8.960 | 0.0 |
| 520 | 2021 | Italy | High pathogenicity avian influenza viruses (poultry) (Inf. with) | H5N8 | Domestic | Birds | 0.078 | 0.059 | 0.037 | 0 | 0.0 | 0.096 | 0.0 |
| 521 | 2021 | Italy | Low pathogenic avian influenza (poultry) (2006-2021) | H5N7 | Domestic | Birds | 0.000 | 0.000 | 0.000 | 0 | 0.0 | 3.018 | 0.0 |
| 522 | 2021 | Italy | Low pathogenic avian influenza (poultry) (2006-2021) | H7N7 | Domestic | Birds | 0.034 | 0.033 | 3.068 | 0 | 0.0 | 3.018 | 0.0 |
| 523 | 2021 | Netherlands | High pathogenicity avian influenza viruses (poultry) (Inf. with) | H5N8 | Domestic | Birds | 0.600 | 0.600 | 64.575 | 2 | 0.0 | 65.175 | 0.0 |
| 524 | 2021 | Poland | High pathogenicity avian influenza viruses (poultry) (Inf. with) | H5N8 | Domestic | Birds | 19.335 | 3.438 | 24.267 | 2 | 0.0 | 27.705 | 0.0 |
| 525 | 2021 | United Kingdom | High pathogenicity avian influenza viruses (poultry) (Inf. with) | H5N1 | Domestic | Birds | 0.103 | 0.099 | 3.105 | 1 | 0.0 | 3.204 | 0.0 |
| 526 | 2021 | United Kingdom | High pathogenicity avian influenza viruses (poultry) (Inf. with) | H5N8 | Domestic | Birds | 0.000 | 0.000 | 0.000 | 1 | 0.0 | 0.001 | 0.0 |
| 527 | 2021 | United Kingdom | Influenza A viruses of high pathogenicity (Inf. with) (non-poultry including wild birds) (2017-) | H5N1 | Domestic | Birds | 0.012 | 0.011 | 0.001 | 1 | 0.0 | 0.012 | 0.0 |